AITopics | bastien bubeck

7a43ed4e82d06a1e6b2e88518fb8c2b0-Paper.pdf

Neural Information Processing SystemsFeb-19-2026, 03:17:38 GMT

When n is independent ofδ our approach yields an algorithm whose sample complexityconvergesto 2n2 log 1δ asngrows.

algorithm, artificial intelligence, log 1, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > District of Columbia > Washington (0.05)
Asia > Middle East > Israel > Haifa District > Haifa (0.05)
(6 more...)

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

Bounded Regret for Finite-Armed Structured Bandits

Tor Lattimore, Remi Munos

Neural Information Processing SystemsFeb-12-2025, 00:59:27 GMT

We study a new type of K-armed bandit problem where the expected return of one arm may depend on the returns of other arms. We present a new algorithm for this general class of problems and show that under certain circumstances it is possible to achieve finite expected cumulative regret. We also give problemdependent lower bounds on the cumulative regret showing that at least in special cases the new algorithm is nearly optimal.

data mining, finite regret, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Oceania > Australia (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.51)

Add feedback

Best-Arm Identification in Linear Bandits

Marta Soare, Alessandro Lazaric, Remi Munos

Neural Information Processing SystemsFeb-10-2025, 00:11:18 GMT

We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the G-optimality criterion used in optimal experimental design.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

Best-Arm Identification in Linear Bandits

Neural Information Processing SystemsMar-13-2024, 14:01:36 GMT

We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the G-optimality criterion used in optimal experimental design.

allocation, allocation strategy, complexity, (15 more...)

Neural Information Processing Systems

Country: Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

Bounded Regret for Finite-Armed Structured Bandits

Neural Information Processing SystemsMar-13-2024, 11:46:53 GMT

We study a new type of K-armed bandit problem where the expected return of one arm may depend on the returns of other arms. We present a new algorithm for this general class of problems and show that under certain circumstances it is possible to achieve finite expected cumulative regret. We also give problemdependent lower bounds on the cumulative regret showing that at least in special cases the new algorithm is nearly optimal.

algorithm, finite regret, theorem 3, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Oceania > Australia (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.51)

Add feedback

053r - Chasing convex bodies and other random topics with Dr. Sébastien Bubeck

#artificialintelligenceDec-31-2019, 11:23:08 GMT

Dr. Sébastien Bubeck is a mathematician and a senior researcher in the Machine Learning and Optimization group at Microsoft Research. He's also a self-proclaimed "bandit" who claims that, despite all the buzz around AI, it's still a science in its infancy. That's why he's devoted his career to advancing the mathematical foundations behind the machine learning algorithms behind AI. Today, Dr. Bubeck explains the difficulty of the multi-armed bandit problem in the context of a parameter- and data-rich online world. He also discusses a host of topics from randomness and convex optimization to metrical task systems and log n competitiveness to the surprising connection between Gaussian kernels and what he calls some of the most beautiful objects in mathematics.

bastien bubeck, data mining, machine learning, (4 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Best-Arm Identification in Linear Bandits

Soare, Marta, Lazaric, Alessandro, Munos, Remi

Neural Information Processing SystemsDec-31-2014

We study the best-arm identification problem in linear bandit, where the rewards of the arms depend linearly on an unknown parameter $\theta^*$ and the objective is to return the arm with the largest reward. We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the $G$-optimality criterion used in optimal experimental design.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

Bounded Regret for Finite-Armed Structured Bandits

Lattimore, Tor, Munos, Remi

Neural Information Processing SystemsDec-31-2014

We study a new type of K-armed bandit problem where the expected return of one arm may depend on the returns of other arms. We present a new algorithm for this general class of problems and show that under certain circumstances it is possible to achieve finite expected cumulative regret. We also give problem-dependent lower bounds on the cumulative regret showing that at least in special cases the new algorithm is nearly optimal.

data mining, finite regret, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.51)

Add feedback

Filters

Collaborating Authors

bastien bubeck

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

7a43ed4e82d06a1e6b2e88518fb8c2b0-Paper.pdf

Bounded Regret for Finite-Armed Structured Bandits

Best-Arm Identification in Linear Bandits

Best-Arm Identification in Linear Bandits

Bounded Regret for Finite-Armed Structured Bandits

053r - Chasing convex bodies and other random topics with Dr. Sébastien Bubeck

Best-Arm Identification in Linear Bandits

Bounded Regret for Finite-Armed Structured Bandits